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SUMMARY 

A highly systematic, non-cloning method of distinguishing and isolating every fragment in a class-IIS or interrupted 
palindrome restriction digest has been developed in our laboratory. These enzymes produce informative, non-identical 
cohesive ends which can be selectively modified by ligation to individual synthetic oligodeoxyribonucleotides with the 
corresponding complementary ends. In this way, polymerase chain reaction and sequencing primer sites and labels can 
be introduced specifically into a single fragment in a total genomic digest. Known and unknown fragments from genomes 
of the complexity of Escherichia coli can be isolated directly in sequenceable form without the necessity of synthesizing 
unique primers. Human DNA has also been accessed in this way. Problems intrinsic to cloning (selective fragment loss, 
mutation and sequence rearrangement) are avoided. Systematic characterization of DNA fragments by their cohesive 
ends and length provides tremendous power and flexibility for analysis of any DNA molecule without specific clones, 
probes or libraries. We report proof of principle of this remarkable system and indicate potential applications in DNA 
sequence tagged site and restriction mapping, sequencing, restriction-fragment-length polymorphism analysis and DNA 
diagnostics. 



INTRODUCTION 

Restriction fragments produced by class-IIS (Szybalski 
et al., 1991 ) and interrupted palindrome ( IP) (Burger and 
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Schinzel, 1983; Quiang and Schildkraut, 1984) restriction 
enzymes have non-identical end sequences. These infor- 
mative ends can be ligated to synthetic DNA indexers to 
permit unique characterization of each fragment. Class- 
IIS enzymes cut a constant distance from their bind'ng 
sites, and IP cut an internal DNA sequence which is not 
specified by the binding site (Fig. 1), so the overhanging 
sequences generated at any one cut site are invariant, but 
not predicted from the binding site sequence. Szybalski 
et al. (1991) reviewed some of the features and applica- 
tions for class-IIS enzymes. 

The specificity of the DNA ligase reaction is used to 
attach complementary indexers to DNA fragments. 
Under routine conditions ligation is specific for comple- 
mentary cohesive ends (although forcing conditions can 
reduce specificity; Tsiapallis and Narang. 1970; 
Sgaramella and Khorana, 1972; Wiaderkiewicz and 
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Fig. I. Class- IIS and IP restriction endonuclease binding and cleavage 
sites. The class-US enzyme Fok\ binds at the 5-GGATG sequence and 
cuts 9 and 13 nt away to generate 4-nt 5'-cohesive ends. The IP enzyme 
S/il binds at the 5 -GGCCNNNNNGGCC sequence and cuts between 
the halves of the recognition sequence to generate 3-nt 3 -cohesive ends. 



Ruiz-Carrillo, l98^PVe have used ligation to import 
selectively primer-binding sites or other structural fea- 
tures onto DNA target fragments. DNA fragments with 
known primer-binding sequences attached to both 
ends can be PCR amplified (Saiki et ah, 1985; Kinzler 
and Vogelstein, 1989). Combinations of an internal, 
specific PCR primer and one external ligated adapter 
have also been used in this way (Patel et aL 1991; 
Smith, 1992). 

When specific indexers have been attached to these 
end sequences, there is no necessity to know any part of 
the- fragment sequence for PCR or sequencing. We de- 
monstrate direct isolation of a variety of fragments from 
digests of X, £. coli and human DNA by ligation of 
indexers followed by PCR. Indexers allow the selective 
definition and amplification of individual DNA frag- 
ments without cloning, opening up a powerful alterna- 
tive route for analysis and sequencing of any DNA 
fragment. 



TABLE I 

Class-NS and IP restriction enzymes which generate digestion fragments with informative cohesive ends 



Enzyme Recognition sequence - End length 6 Fragment length 0 Fragment classes* 1 



5'«Protruding cohesive ends: 



Bhsl 


GAAGACN<2/6> 


4 


2024 


32896 


Bbvi 


GCAGCN<8/12) 


4 


512 


32896 


Bsal 


GGTCTCN(l/5) 


4 


2024 


32896 


BsmA 1 


GTCTCN(l/5) 


4 


512 


32896 


Bs P M\ 


ACCTGCN<4/8) 


4 


2024 


32896 


Earl 


CTCTTCNU/4) 


3 


2024 


2080 


Ecolll 


GGTCTCNI 1/4) 


3 


2024 


2080 


£sp3I 


CGTCTCNU/5) 


4 


2024 


32896 


Fold 


GGATGN<9/13) 


4 


512 


32896 




GACGCNI5/10) 


5 


512 


524800 


Kspblll 


CTCTCNjl/4) 


3 


512 


2080 


Sapl 


GCTCTTCN(i/4) 


3 


8196 


2080 


S/aNI 


GCATCN(5/9) 


4 


512 


32896 


Stsl 


GGATGN( 10/14^ 


4 


512 


32896 


3'-Protruding cohesive ends: 








ApaB\5 


GCANNNNN/TGC 


5 


4096 


524800 


Bgt\ 


GCCNNNN/NGGC 


3 


4096 


2080 


BsiXl 


CCANNNNN/NTGG 


4 


4096 


32896 


Oral II 


CACNNN/GTG 


3 


4096 


2080 


pjtm 


CCANNNN/NTGG 


3 


4096 


2080 


RleAl 


CCCACAN(12/9) 


3 


2024 


2080 


SJi\ 


GGCCNNNN/NGGCC 


3 


65 536 


2080 



* The recognition sequence and position of cleavage site of each restriction enzyme are shown ( Roberts and Macelis. 1992). N: any base; (x/y): number 
of nt between recognition site and cleavage site on each strand of the DNA. 
b Length of the cohesive end (nt) generated by enzyme cleavage. 

c Estimated assuming random base distribution, average fragment length (bp) will be 4*, where x is the number of nt in the recognition sequence. 
d The number of permutations of end sequences taken in pairs; for example. (256 x( 255/2)) + 256 = 32 896 fragment classes are possible for fragments 
with 4-nt cohesive ends (256 = 4 4 ); the added 256 represent the classes of fragments with identical ends. 
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RESULTS AND DISCUSSION 

(a) Class-US and IP restrict! n enzymes 

Class-IIS and IP enzyme digests are suitable for index- 
ing because they produce non-identical end sequences. 
There are 4* such end sequences, where x is the length 
of the overhang produced by digestion. These overhang- 
ing ends are the targets for indexer ligation. Table I lists 
class-US and IP enzymes of interest for indexing. Our 
initial studies have made use of two of these enzymes: 
Fokl is a class-IIS enzyme that cuts DNA every 512 bp, 
on average, and generates 4-nt 5'-cohesive ends; Sjil is 
an IP enzyme that cuts rarely (about every 65 kb), gener- 
ating 3-nt 3'-cohesive ends (Fig. 1). There are 4 4 = 256 
possible ends generated by Fokl digestion and 4 3 = 64 
possible ends generated by Sjil digestion. 

(b) Indexers 

Indexers consist of an individually synthesized indexer 
strand annealed to a complementary common sequence 
(Fig. 2). Each indexer has a specific cohesive end se- 
quence. The 64 different possible 3-nt ends generated by 
Sjil require 64 indexers for complete access; the 256 
different possible 4-nt Fokhcnd sequences require 256 
indexers. 



Fo/d indexers 



24-nt indexer strand 



5'-P- NNNNTTGTCTTCGlCATC C T G T A C C -0H-3' 
3' HO-A AC A G AAG C iG T A G g j A C AT G G -OH-5' 

20-nt common sequence/primer 



SfA indexers 



23-nt indexer strand/primer 



5'HO-A AGTGTAGAGATGTA:GGCCTNN N-OH-3' 
3'-H0-T TCACATCT CTACATCCGG A-P-5' 



20-nt common sequence 

Fig. 2. Indexer structure. Fokl indexers have a 20-nt common sequence 
<#1504) annealed lo one of 256 individually synthesized 24-nt comple- 
mentary indexer strands with 4-nt 5 -cohesive ends. #1504 is the univer- 
sal PCR primer for FoAI-indexed fragments. Sjil indexers have a 20-nt 
common sequence t#3030-P) annealed to one of 64 individually synthe- 
sized 23-nt complementary indexer strands with 3-ni 3'-cohesive ends. 
The S/il-indexer strand is the PCR primer. Oligos were synthesized on 
a Milligen Biosearch DNA synthesizer and purified by TLC (Archer 
et a!.. 1985). FoAI-indexer strands and the S/il-common sequence were 
phosphorylated at the 5 -end. The two strands of the indexers were 
annealed by mixing equimolar quantities of an indexer strand with the 
appropriate common sequence at a concentration of 2 uM each in TC 
1 10 mM Tris HCl pH 7.8/0.1 mM CDTA pH 8.0), heating to 65 C for 
5 min. and cooling at 1 C 10 s to room temperature using an 
MJ Research PTC-100. Annealed indexers were diluted to 50 nM in 
TC for use. 



(c) Indexing known Fokl DNA fragments 

To test indexing we ligated indexers to increasingly 
complex DNA digestions. The pBR322 plasrnid has 12 
Fofcl-generated fragments (Watson, 1988). A 288-bp Fokl 
fragment with an internal Pstl site was identified between 
3513 and 3800 bp in the pBR322 sequence. The 
5'-overhanging ends on this Fokl fragment are 5'-CTTT 
and S'-TAAG. Indexers with complementary ends 
(5'-pAAAG, 5-pCTTA) were ligated to a pBR322 Fokl 
digest. The ligated DNA was amplified by PCR using the 
primer #1504, and the products analyzed by agarose-gel 
electrophoresis. There were no PCR products from unli- 
gated DNA or DNA digests that had been indexed with 
either the 5'-pAAAG or the 5'-pCTTA indexer alone. A 
PCR product was observed only when both indexers were 
present in the ligation. Its size was consistent with the 
predicted length of 288 bp plus 40 bp contributed by the 
indexers. Digestion of this PCR product with Pstl gave 
rise to the expected cleavage products (about 215 and 

120 bp; Fig. 3). 

Bacteriophage X has 150 Fokl sites (Sanger et al., 1982). 
We synthesized a pair of indexers with 5'-pATAT and 
5'-pAACG cohesive ends to target a 315-bp Fokl frag- 
ment from the X genome (between 11 171 and 11485 bp). 
The indexers were ligated to a X Fokl digest, and the 
ligated DNA was amplified using the primer #1504. 
Again, unless both indexers were used in the ligation, no 
PCR products were observed. When both indexers were 
present, a PCR product of the predicted size (355 bp) was 
found. Digestion of this product with Hhal gave a cleav- 
age pattern consistent with that predicted from the se- 
quence (Fig. 4), and sequence analysis confirmed the 
identity of the fragment. This fragment was selected to 
demonstrate indexing with a self-complementary indexer 
end. Similar results confirming the specificity of indexing 
in genomes of increasing complexity have been obtained 
(A.C.Y. Chang and H.L. Heyneker, personal 
communication). 

(d) Indexing unknown Fokl fragments by random access 

There are about 9000 Fokl fragments in £. coli, assum- 
ing a random distribution of DNA sequences and a total 
genome of 4.67 x 10 6 bp (Perkins et al., 1992). In £. coli 
we predict that there are about 70 ligation targets for any 
given indexer; at random the other end of some of these 
target fragments will be complementary to other indexers 
being tested. Such fragments, indexed at both ends, will 
be amplification substrates. The complementary Fokl- 
end sequences of PCR products can be deduced from the 
indexers leading to fragment amplification. We estimate 
that the odds of obtaining a fragment with any two given 
ends in the whole E. coli genome is about 27%. The 
indexers synthesized to target pBR322 and X Fokl frag- 
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Fig. J 



Fig. 4 



Fig. 5 



Fig. 3. PCR amplification and enzyme digestion of an indexed pBR322 Fokl fragment. Lanes: 1 and 7, 123-bp ladder DNA molecular weight standard 
(Gibco BRL. Life Technologies. Burlington, Ontario, Canada); 2, no indexers in ligation (see Methods below); 3, 5-pAAAG indexer alone; 4, 5 -pCTTA 
indexer alone; 5, both indexers together giving a 330-bp PCR product; 6, digestion of the product with Pstl (Gibco BRL) giving 215- and 120-bp 
fragments. Methods: pBR322 DNA (NE Biolabs, Beverly, MA, USA) was digested with Fokl (NE Biolabs) at 2 u/ug DNA for 30 min at 20 C. 
extracted with 1/2 vol. CI AA (24:1 ). and ethanol precipitated. DNA was recovered by microcentrifugation at 14000xg for 30 min. washed with 70% 
ethanol. air-dried, and dissolved in TC to 200 ng/pl. DNA ( 100 ng) was ligated for-30 min at 37 "C in reactions containing 4 u T4 DNA ligase (NE 
Biolabs) and 50 fmol of each indexer (as specified for each lane) in a total volume of 20 ul recommended buffer. Reactions were terminated by heating 
to 95 C for 5 min. 10 ng of DNA from each ligation were amplified by PCR in an MJ Research PTC- 100 using the following program: ( 1 ) 95'C 5 
min; (2) 80 C. hold during addition of 2 u AmpliTaq DNA polymerase ( Perk in Elmer. Nepean, Ontario, Canada); (3) 94C 1 min 15 s; (4) 55 C, 30 
s; (5) 72 C. 1 min 30 s; (6) cycle to step 3. 29 times; (7) 72 C, 5 min; (8) 15 C hold. PCR reactions contained 1 mM MgCK and 40pmol primer 
#1504 in 50 ul of the recommended buffer, covered by 50 ul light mineral oil (Sigma, St. Louis. MO, USA). PCR reactions containing DNA fragments 
were extracted with 100 ul C1AA. ethanol- precipitated and resuspended in TC. A sample was then digested with 5 u Ps/I at 37 C for I h in the 
recommended buffer. Amplification products were electrophoresed at 60 V for 1 h on 1.8% agarose gels in TPE buffer containing 500 ng ml 
ethidium bromide. 

Fig. 4. PCR amplification and enzyme digestion of an indexed k Fok\ fragment. Lanes: 1 and 7, 123-bp ladder DNA molecular weight standard; 2. 
no indexers in ligation: 3. 5 -pATAT indexer alone; 4, 5 -pAACG indexer alone; 5, both indexers together giving a 355-bp PCR product; 6, fragment 
digested with Hhal (NE Biolabs) giving 295- and 60-bp fragments. X DNA (NE Biolabs) was digested with Fokl, ligated to indexers, and PCR 
amplified as described in Fig. 3. PCR reactions containing DNA fragments were digested with 7.5 u Hhal at 37°C for 1 h in the recommended buffer 
and electrophoresed as described in Fig. 3. 

Fig. 5. PCR amplification of indexed Fok\ fragments from whole genomic digests of E. coli B. Lanes: ] and 13, 123-bp ladder DNA molecular weight 
standard: 2, no indexers: 3. 5'-pAAAG indexer alone: 4, 5'-pCTTA indexer alone: 5, 5'-pATAT indexer alone: 6. 5'-pAACG indexer alone: 7, 
5 -pAAAG + 5 -pCTTA indexers; 8, 5 -pAAAG + 5 -pATAT indexers; 9. 5-pAAAG + 5 -pAACG indexers; 10, 5 -pCTTA + 5-pATAT indexers; II. 
5'-pCTTA + 5'-pAACG indexers; 12. 5-pATAT + 5 -pAACG indexers. E. coli B DNA (Sigma) was digested with Fokl ligated to indexers. and PCR 
amplified and electrophoresed as described in Fig. 3. 



ments were ligated to Fo/c I -digested E. coli B DNA to 
see if fragments could be indexed and amplified from such 
a complex fragment pool. Fig. 5 shows the results of am- 
plifying E. coli B Fofcl-digested DNA indexed with the 
ten single and pair-wise combinations of the four index- 
ers. Two unique fragments were obtained, present only 
when both the 5'-pCTTA and 5'-pATAT indexers are 
used. The fragment-end sequences must, therefore, be 



5'-TAAG and 5-ATAT for both fragments. These results 
are evidence that there is a good chance of indexing a 
fragment entirely at random in a whole genomic digest 
of £. coli with any given indexer or pair of indexers. 

Recall that the two cohesive-end sequences of a DNA 
fragment are independent. Using both end sequences, 
32 896 distinct sets of Fokl fragments can be defined. In 
E. coli most pairs of indexers will not index an amplifiable 
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DNA fragment, as only 9000 fragmen expected vs. 
32896 unique pairs of ind x rs. Poisson statistics predict 
that about 10% of the defined subsets will contain more 
than one fragment. These sets of fragments with the same 
end sequences can be separated by length. 

Under the conditions of ligation used, the extent of 
joining of indexers to any of the twelve possible single bp 
mismatched ends is not apparent. Even a limited amount 
of such 'misligated' product wbuld be amplifiable by 
PCR, resulting in multiple product bands for every pair 
of indexers. Preliminary experiments to measure the rate 
of ligation at mispaired ends indicated that this rate is 
less than 10" 3 of the correct ligation rate (unpublished 
data). 

(e) Indexing DNA fragments with dissimilar cohesive ends 

To see if different types of cohesive-end sequences can 
be accessed simultaneously, we indexed Fokl-Sfil frag- 
ments in the SV40 genome. SV40 contains a single Sfil 
site (Fiers et al., 1978). The cohesive ends generated by 
Sfil were identified, as were the cohesive ends at the 
flanking Fokl sites f Fig. 6). Complementary indexers 
were synthesized to access the individual Left hand and 
Right hand fragments at this Sfil site. Fig. 7 shows the 
results of indexing and amplifying the SV40 Fokl-Sfil 
fragments. Amplification of the target fragments required 
both the Fokl and Sfil primers; amplification with either 
primer alone did not give rise to a PCR product. 

We have proposed using indexed fragments centered 
on Sfil sites in STS mapping (Unrau et al., 1990). The 
plsB gene in E. coli contains an Sfil site with adjacent 
Fokl sites located 1436 bp and 472 bp away. Fig. 8 shows 
the indexed amplification of these two fragments. 
Sequence analysis of the smaller fragment confirmed its 
identity as the predicted fragment of the plsB gene. This 
result also demonstrated that sequenceable templates can 
be generated directly from genomic DNA by indexed 
amplification. 

(f ) Indexing human genomic DNA 

We have tested DNA indexing in human genomic 
DNA. The frequency of cleavage of human DNA by Notl 
and Sfil suggested that a number of Notl-Sfil fragments 
would be short enough to permit PCR amplification. 
Using one indexer complementary to the Notl end 
(5-HOGGCC) and specific indexers complementary to 
possible Sfil ends we amplified six fragments (Fig. 9). 
Preliminary sequence data suggests that these products 
are specific and further characterization is in progress to 
confirm their identity. Religation of Notl ends and ampli- 
fication of the few short Notl-Notl fragments were pre- 
vented by using an excess of the 5'-OH indexer. 



fl\ 



Fokl 



/ 




Fokl 




SV40 



(1 1 Fold sites) 




\ 



Fok\ 



Sfil 



Fok\ 



|GCAT 


L 


AGG 


R 








TCC 




TCCC | 



GCAT 


L 


AGG ( 




CGTA 




TCC 


— 



ATGC FoJd Indexer 



CCT SfA indexer 



AGG 


R 


AGGG 




TCC 




TCCC 





AGG Sffl indexer 



AGGG FoM Indexer 



Fig. 6. §V40 Fokl-Sfil sites targeted for indexing. Sfil cuts the circular 
SV40 genome at a single site (5238 bp, Fiers et at. 1978) producing 
AGG-3' and CCT-3' cohesive ends. Cleavage of the adjaceni Fokl sites 
(at 5049 bp and 97 bp) generates 5-pCCCT and 5-pGCAT cohesive 
ends. The two Fokl-Sfil fragments can be indexed using CCT-3' and 
5 -pATGC indexers, and AGG-3' and 5'-pAGGG indexers. shown 
joined to the target fragments by ligation. L and R refer to the left and 
right sides of the Sfil site, respectively. 



(g) Conclusions 

(/ ) We describe a non-cloning method for defining, am- 
plifying and isolating specific DNA fragments from com- 
plex genomic digests. This method depends upon the 
selectivity of ligation to attach indexers to unknown 
DNA fragments, importing primer-binding sites or other 
structural features which may be useful in fragment 
manipulation, including reporter or affinity groups. 

(2) The biological variables which affect cloning do not 
aflFect indexing technology. 

(3) The data demonstrate the specific, selective ligation 
of indexers to complementary end sequences in DNA di- 
gests. Indexers import a primer-binding site to the non- 
identical cohesive-end sequences of fragments produced 
by class-IIS and IP enzyme digestion. On PCR, such 
indexed fragments give rise to amplification products that 
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Fig. 9 



Fig. 7. PCR amplification or indexed FokUSfil fragments of SV40 Lanes: 1 and 8, 123-bp ladder DNA molecular weight standard: 2. ligation (i) 
amplified with PCR primer #1504; 3, ligation <i) amplified with PCR primer CCT-3' S/il-indexer strand: 4, ligation (i) amplified with PCR primers 
#1504 and CCT-3' S/il-indexer strand; 5, ligation Hi) amplified with PCR primer #1504; 6, ligation amplified with PCR primer AGG-3' Sftl- 
indexer strand; 7, ligation Ui) amplified with PCR primers #1504 and AGG-3' S/il-indexer strand. Methods: SV40 DNA (Gibco BRL) was digested 
with Fokl (see Fig. 3), followed by two digestions with 2 u Sfil (NE BiolabsVug DNA at 50X overnight under oil. The digest was extracted with 
CI A A, ethanol precipitated and dissolved to 200 ng/jil in TC. Conditions for ligations and PCR amplification were as described in Fig. 3. Ligations 
contained 50 fmol of the following indexers: (i) 5 -pATGC Fokl indexer and CCT-3' Sfil indexer (ii) 5-pAGGG Fokl indexer and AGG-3' Sfil indexer. 
PCR reactions were analyzed by electrophoresis as described in Fig. 3. 

Fig. 8. Amplification of indexed £. coli pkB gene (Gen Bank accession No. KOOI27) Fokl-Sfil fragments. Lanes: 1 and 4, 123-bp ladder DNA molecular 
weight standard; 2, ligation (i) amplified with PCR primer #1504 and PCR primer GGC-3' S^I-indexer strand; 3. ligation (ii) amplified with PCR 
primers #1504 and GCC-3' S^I-indexer strand. £. coli B DNA (Sigma) was digested with Fokl (see Fig. 3) followed by digestion twice with Sfil (as 
described in Fig. 7). The digest was extracted with CI A A, ethanol precipitated and dissolved to 200 ng/ul in TC. Conditions for ligations and PCR 
amplification were as described in Fig. 3. Ligations contained 50 fmol of the following indexers: ii) 5-pTCCG Fokl indexer and GGC-3' Sfil indexer; 
(ii) 5 -pCGCG Fokl indexer and GCC-3' Sfil indexer. PCR reactions were analyzed by electrophoresis as described in Fig. 3. 

Fig. 9. Amplification of indexed human NotUSfil fragments. Lanes: 1 and 8. 123-bp ladder DNA molecular weight standard; 2. ligation (i) amplified 
with primer #1504 and CCA-3' S/il-indexer strand: 3. ligation (ii) amplified with primer #1504 and CCT-3' S/fl-indexer strand; 4, ligation (Hi) amplified 
with primer #1504 and TAC-3' S/il-indexer strand: 5. ligation (iv) amplified with primer #1504 and TAG-3' S/il-indexer strand; 6, ligation (v) ampli- 
fied with primer #1504 and CGG-3' 5/iI-indexer strand; 7, ligation (vi) amplified with primer #1504 and CTG-3' S/il-indexer strand. Human DNA 
[isolated from cultured LCL cells essentially as described by Jeanpierre ( 1987)] was twice digested with Sfil as indicated in Fig. 7 and with Notl (NE 
Biolabs) 2 u/pg for 2 h at 37X and recovered as indicated in Fig. 3. Ligation and PCR conditions (40 cycles) are as described in Fig. 3. Ligations 
contained 100 ng digested DNA. 50 fmol of the 5 -HO-GGCC Fokl indexer (complementary to the Notl end) and 50 fmol of one of the following 
Sfil indexers: (/> CCA-3'; [ii) CCT-3'; [Hi) TAC-3'; (/r) TAG-3'; [v) CGG-3'; (vi) CTG-3'. PCR reactions were analyzed by electrophoresis as described 
in Fig. 3. 



correspond in length and restriction enzyme cleavage 
sites to those predicted from whole sequence analysis. 

(4) Non-specific amplification of artefacts was not ob- 
served with the primers used. 

(5) Ligation reactions can attach 4-nt 5'-protruding, 
or 3-nt 3'-protruding indexers simultaneously to frag- 
ments with different sorts of ends, including self- 
complementary ends. 

(6) A single primer suffices for PCR of indexed Fokl 
fragments, while both Fokl and Sfil primers are required 



for PCR of fragments indexed with both Fokl and Sfil 
indexers. This proves that we are specifically importing 
primer-binding sites to indexed fragments. 

(7) In whole genomic digests of £. coli B a limited set 
of fragments amplify after ligation to all pairwise combi- 
nations of four indexers. Successful indexing of Fokl di- 
gests of E. coli genomic DNA shows that selective ligation 
has the specificity to subdivide complex mixtures of DNA 
fragments in an orderly way. 

(8) Because the number of possible cohesive ends is 
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limited, this method could be systematically applied both 
to fragment isolation and to access of information about 
each fragment. 

(9) Because DNA fragments can be directly indexed, 
amplified and sequenced, it is reasonable to expect that 
indexing could be applied to the mapping and sequencing 
of YACs ( Burke et al., 1987), isolated yeast chromosomes, 
whole bacterial genomes, and human genomic DNA. 
Indexing applied to detectiop of mutations at the 
cohesive ends of fragments would result in a variant 
of RFLP analysis called Restrict ion- Fragment End 
Polymorphisms (RFEPs) potentially useful for DNA 
diagnostic studies. 
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